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Abstract. We study the long-time behaviour of the focusing cubic NLS on 
R in the Sobolev norms H s for < s < 1. We obtain polynomial growth- type 
upper bounds on the H a norms, and also limit any orbital H 3 instability of the 
ground state to polynomial growth at worst; this is a partial analogue of the 
H 1 orbital stability result of Weinstein [27], [26]. In the sequel to this paper we 
generalize this result to other nonlinear Schrodinger equations. Our arguments 
are based on the "/-method" from earlier papers [9]- [15] which pushes down 
from the energy norm, as well as an "upside-down /-method" which pushes up 
from the L? norm. 



1. Introduction 

We consider the long-time behaviour of solutions to the Cauchy problem for the 
one-dimensional focusing cubic nonlinear Schrodinger equation 

iut +u xx = -F(u); u(x,ti) = u (x) (1.1) 

where u{x, t) is a complex-valued function on R x R, F(u) is the focusing cubic 
nonlinearity F{u) :— uuu, and uq(x) lies in the Sobolev space H S (R) for some 

s e R. 

It is known ([25], [1]) that the Cauchy problem (1.1) is globally well-posed in 
H s for all 1 s > 0. Furthermore, due to the many conservation laws of (1.1), we 
know that if s is an integer and the initial data is in H s , then the H s norm stays 
bounded for all time. For instance, for s = 1 one can obtain uniform H 1 bounds 
by exploiting the conservation of the Hamiltonian 

H{u) := |Ik| 2 _Im 4 dx 
and the L 2 norm, combined with the Gagliardo-Nirenbcrg inequality. 



1991 Mathematics Subject Classification. 35Q53, 42B35, 37K10. 

Key words and phrases. Schrodinger equation, upper bound on sobolev norms, orbital stability. 
J.E.C. is supported in part by N.S.F. grant DMS 0100595 and N.S.E.R.C. grant RGPIN 250233- 

03. 

M.K. is supported in part by N.S.F. Grant DMS 9801558. 

G. S. is supported in part by N.S.F. Grant DMS 0100375 and by a grant from the Sloan 
Foundation. 

H. T. is supported in part by J.S.P.S. Grant No. 13740087. 

T.T. is a Clay Prize Fellow and is supported in part by a grant from the Packard Foundation. 
^For s < one does not even have local well-posedness, at least if one demands uniform 
continuity of the solution map; see [21] . 

31 



32 J. COLLIANDER, M. KEEL, G. STAFFILANI, H. TAKAOKA, AND T. TAO 

However when s is not an integer, the standard iteration argument only gives 
bounds on the H s norms which grow exponentially in time. For s > 1 there are 
polynomial growth bounds 2 in [24] , [3] , [8] . 

The first result of this paper is to extend these techniques to s between and 1. 

Theorem 1.1. 7/0 < s < 1 and u n E H s , then we have 

\\u(t)\\ H ,<c(\\u a \\ HS )(i + \t\r+. 

The proof of this theorem proceeds by an "upside-down" version of the "I- 
method" ([9], [11], [12], [13], [14], [20]; see also [6], [10], [19]), in which one applies 
a differentiation operator V = T>n to the solution instead of a smoothing operator 
/ = In, and proves that the quantity \\T>Nu{t) ||| is almost conserved in time. 
One should compare the result in Theorem 1.1 with the ones established in [3], 
[8] and [24] for the time asymptotic of the H s norm for smooth (s > 1) KdV and 
Schrodingcr type solutions. The method used by Bourgain in [3] is also based on an 
improved local estimate, but the improvement is not obtained by replacing the H s 
norm with a better integral (which is our argument here), but instead by using well- 
posedness results below the energy norm H 1 . In [8] and [24] the improvement of 
the local estimate is obtained by using sharp bilinear estimates in negative Sobolev 
spaces. 

We remark that the same polynomial growth result in Theorem 1.1 also holds for 
the defocusing cubic NLS (in which — \u\ 2 u is replaced by +|m| 2 m) and is slightly 
easier to obtain. It is likely that one can use the correction term techniques in 
[10], [14] to improve the exponent 2s+ substantially 3 , perhaps all the way down to 
0+. Certainly one expects to obtain an exponent which goes to as s — > 1 _ by 
exploiting conservation of the Hamiltonian. 

One can view Theorem 1.1 as a bound on the possible H s instability of the 
solution u = to (1.1). It is not strong enough to say that small H s perturbations 
to this solution at time zero remain small perturbations for all later time, but it 
limits the growth of the perturbation to polynomial growth at worst. 

The next result of this paper concerns the H s orbital stability of ground states 
for (1.1). For simplicity we shall only consider the ground states at energy 1 (the 
other energies can then be recovered by a scaling argument 4 ). It is known [7] that 
there exists a unique even positive Schwartz function Q(x) on R which solves the 



It is an open question whether one has some sort of scattering in H s for this equation, which 
would of course imply that the H s norm remains bounded. Some recent progress in this direction 
is in [23]. 

3 In the particular case of the cubic ID NLS equation (1.1), one may also be able to exploit 
the complete integrability of the equation to obtain bounds on the H s norms which are uniform 
in time, and perhaps also to obtain global stability bounds for solitons and multisolitons as well. 
On the other hand, the methods here do not exploit complete integrability and are applicable to 
a wide range of Hamiltonian evolution equations. 

4 More specifically, for every energy E > 0, there is a unique positive even Schwartz function 
Qe obeying (Qe)xx + \Qe\ 2 Qe = EQe, but these ground states arc linked by the scaling 
Qe(x) = E 1 ' 2 Q(E 1 ' 2 x). Because the equation (1.1) is L 2 -subcritical, all of these ground states 
have different L 2 mass. Since the L 2 mass is an invariant of the NLS flow, we can thus restrict 
our attention to a sphere in L 2 , in which case only one energy E is relevant. One can then use 
the scale invariance of (1.1) to set E = 1. 
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equation 5 

Qxx + \Q\ 2 Q = Q- (1.2) 

The Cauchy problem (1.1) with initial data uo — Q then has an explicit solution 
u(t) = e lt Q. More generally, for any xo G R and e %e G S 1 , the Cauchy problem 
with initial data u (x) = e lB Q{x — x ) has explicit solution e^ e+t ^Q(x — x ). If wc 
thus define the two-dimensional ground state cylinder 6 E C i/ 1 (R) by 

E := {e l6 Q{- - x ) : x G R, e w G S 1 } 

we see that the nonlinear flow (1.1) preserves E. Also note that each element of E 
obeys (1.2) (though of course most ground states are not even or positive). 

In [27] (see also [26]) Weinstein showed that the ground state cylinder S was 
H^-stable. More precisely, he showed an estimate of the form 

dist^i (u(t), E) ~ dist^i (uq, E) (1-3) 

(when dist^i (uq, E) is small), for all H 1 solutions u(t) to (1.1) and all times t G R. 
In other words, solutions which start close to a ground state in H 1 at time t = 0, 
will stay close to a ground state for all time (though the nearby ground state may 
itself vary in time 7 ). 

To prove (1.3), Weinstein employed the Lyapunov functional? 

L(u) := 2H(u) + J \u\ 2 = J \u x \ 2 + \u\ 2 - ^\u\ 4 dx, (1.4) 

which is well-defined for all u G H l . Since this quantity is a combination of the 
Hamiltonian and the L 2 norm, it is clearly an invariant of the flow (1.1). More 
explicitly, for sufficiently smooth functions u(x,t) we have the formula 

d t L(u) = 2(u t ,-u xx + u-F(u)) (1.5) 

which clearly vanishes if u solves (1.1). Here and in the sequel we use (, ) to denote 
the real inner product 

(u,v) := Re J uv dx. 

^From (1.5) and (1.2) we see that the ground states in E are critical points of L. 
In fact they are minimizers of L; more precisely, we have the fundamental coercivity 
estimate 

L(u) — L{Q) ~ dist ff i (u, E) 2 whenever u G H 1 and dist^i (u, E) <^ 1; 

(1.6) 

see [27]. The stability estimate (1.3) then follows easily from (1.6) and the conser- 
vation of L. 

5 Indeed, we have the explicit formula Q(x) := 2 -1 / 2 / cosh(a;), although wc will not use this 
formula in this paper. 

^Note that the ground state cylinder is the orbit of Q under the phase and translation invari- 
ances of NLS. We do not utilize the scaling invariance because, as mentioned earlier, this changes 
the L 2 norm of Q. Also we do not utilize Gallilean invariance because this does not preserve the 
Hamiltonian. 

7 For instance, consider the solution u(x, t) = e l (. ex — e t '>e tt Q(x — 2et) for some small e; this is a 
Gallilean transformation of the ground state solution u(t) = e lt Q, and is close to this solution at 
time zero. However at later times, the solution slowly drifts away from the original ground state 
solution, although it remains close to the ground state cylinder E. 

8 This is the functional for energy E = 1. For other energies it is given by L(u) = 2H(u) + 
JE\u\ 2 . 
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Note that the functional L is invariant under phase rotation u — > e lS u and 
translation u i— ► u(- — xo). Thus one cannot expect a coercivity bound like (1.6) 
in these directions. Of course, this is consistent with (1.6) since the ground state 
cylinder S is itself invariant under these symmetries; the point of (1.6) is that 
there are no other directions (in the tangent space of H 1 at Q) for which L can be 
invariant or concave. 

The second main result of this paper is to partially extend the H 1 orbital stability 
result to an H s orbital stability- type result for < s < 1. Unfortunately, as in 
Theorem 1.1, our estimate loses a polynomial factor in t, so we cannot exclude the 
possibility of polynomial orbital instability in H s : 

Theorem 1.2. Let < s < 1, and suppose dist#s(u , £) <C 1. Then we have 

disttf* (u(t), £) < i 1 - 8+ dist jr . (u , S) (1.7) 

whenever 1 < t -C dist#s (u , E) _1 /( 1_s+ ). In particular, u(t) stays in a bounded 
subset of H s for all times \t\ < dist H » (u , S)" 1 /( 1 -«+) . 

Note that a naive application of the local well-posedness theory would lose a 
factor of the form exp(Ci) in the estimate (1.7), so that one could only assure u(t) 
stays in a bounded subset of H s for times \t\ < log(l/distjjs (un, E)). Note that for 
times t close to dist#s(uo, S) 1 /( 1_s+ ', the right-hand side of (1.7) is close to 1, so 
that we are no longer keeping u(t) close to £. At this point one can use Theorem 
1.1 to control the further development of u(t). 

The proof of (1.7) proceeds via the smoothing operator I = In mentioned earlier. 
Basically, the idea is to show that the modified Lyapunov functional L(Iu) is almost 
conserved, and then combine this with (1.6) to obtain (1.7). It turns out that 
a naive implementation of this approach loses an epsilon power of distil (u , E) 
because the operator / does not quite preserve the ground state cylinder, but this 
can be rectified by the standard technique of choosing an approximating ground 
state to obey specially chosen orthogonality conditions. For expository reasons we 
have chosen to give the naive versions of the argument first, and only give the full 
argument at the end of the paper. 

The factor t 1 ~ s+ in (1.7) can probably be reduced, however the factor dist^a (u , S) 
on the right-hand side is necessary (as can be seen even when t ~ 1). 

In all of these arguments it is crucial that the L 2 and H 1 norms are both sub- 
critical (in the sense that they scale as a negative power of length using the natural 
scaling of (1.1)). It is because of this that we cannot extend these results to any 
other NLS with an algebraic nonlinearity 9 . In the sequel to this paper we shall 
obtain some partial results of the above type in the case when p is not an odd 
integer; the main new difficulty is to commute the I operator with non-algebraic 
nonlinearities F(u). 

2. Notation 

We use A < B to denote A < CB, where C is a constant depending on s which 
may vary from line to line. We use a+, a— to denote quantities of the form a + e, 
a — e, where e is arbitrarily small. We use (£) to denote 1 + 

^Morc specifically, the NLS equation is only L 2 subcritical when p < 1 + — , while an algebraic 
nonlinearity only occurs when p is an odd integer. Since p > 1, the only subcritical algebraic 
equation occurs when n = 1, p = 3. 
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We define the spatial Fourier transform by 

- lx ^f{x) dx 

and the spacetime Fourier transform by 

u(£,t):=[ [ e-^ +t ^u(x,t) dxdt. 
JH, JUL 

Following [2], we define the X s ' b spaces by the norm 

\\u\\ x „ b := \\u\\ s , b := \\(0 S (T-e) b u(t,T)\\ L2TLl . 

For any time interval /, we define the restricted X s ' b spaces by 

Xf :={u\ RxI :u€X*> b } 

with the usual norm 

\\v\\ x s, b :=inf{||u|| s , 6 :u| RxJ = t;}. 
We shall need the Strichartz estimate 

\\ u \\l%^ £ ll«llo,i/2+; (2-1) 

this can be obtained by writing an _Y/ 0,1 / 2+ function as an average of modulated 
free Schrodinger waves (as in [17]) and then using the lfi x t Strichartz estimate for 
free solutions (see e.g. [18] and the references therein). 

Interpolating (2.1) with the trivial bound ||u|| L 2 ^ < ||u||o,o we obtain 

IMIz4 >t < ||«||o,3/8+- (2-2) 
We also record the variant estimate 

IMLfLj % ll«llo,i/2+; (2.3) 

this can be obtained by interpolating (2.1) with the energy estimate IMIl^l^ ^ 

ll u llo,l/2+- 

^From [6] (see also [13]) we recall the improved bilinear Strichartz estimate (in 
one spatial dimension) 

IKu 2 ||^ < JV- 1 / 2 ||ui||o,i/2 + IMIo,i/2+ (2.4) 

whenever u\ has Fourier support in the region |£| ~ N, and U2 has Fourier support 
in the region |£| <C N. 

Let n > 2, and let m(£i,... , £„) be a function supported on the hyperplanc 
{£i + . . . + £„ = 0}. We use the notation 

A n (m(£i,... ,£„);/i,... ,/„) 

to denote the multilinear form 



A„(m(£ 



, ■■■ ,tn);fi,--- ,.fn) ■= / ,£ n )/i(£i)... /„(£„). 

->6+-+ln=0 
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3. Proof of Theorem 1.1 

We shall divide the proof of this theorem into several broad steps. These steps 
will also appear in the proof of Theorem 1.2. 

• Step 0. Preliminaries; introduction of the modified energy. 

Fix < s < 1, and fix the global H s solution u. Henceforth all implicit constants 
may depend on s and the quantity ||uo||i? s - O ur task is to show that 

\\u(T)\\ H s<T 2s + (3.1) 

for all T 3> 1. By the H s global well-posedness and regularity theory (sec e.g. [1]) 
and the usual limiting argument it suffices to do this for smooth, rapidly decreasing 
u. 

We now apply an "upside-down" version of the /-method; whereas the strategy 
of the I method is to mollify the solution at high frequencies to make it smoother 
(e.g. in the energy class H 1 ), here we amplify the solution at high frequencies 
instead to make it rougher (specifically, we place it in L 2 ). In contrast, the proof 
of Theorem 1.2 in later sections will proceed via the ordinary "/-method". 

Fix T, and let N ^> 1 be a large quantity depending on T to be chosen later. 
Let 0(£) be a smooth even real-valued symbol such that = 1 for |£| < N and 
e (0 = \C\ S /N S for |£| > 2N, and let V be the Fourier multiplier 

vf(0 := 0(0/(0- (3-2) 

Thus V is the identity for low frequencies |£| < N, and becomes a differentiation 
operator of order s for high frequencies |£| > N. 
Define the modified energy En (t) at time t by 

E N (t) := ||Z>u(i)||!. ( 3 - 3 ) 
^From Plancherel we have the upper bound 

En(0) < \\u \\ 2 H s < 1. 

The heart of the argument shall lie in the following almost conservation law for 
En- 

Lemma 3.1. If to € R is such that En (to) < C for some bounded constant 
C = 0(1), then we have 

E N (t + S) = E N (t ) + O(N- 1 / 2 +) (3.4) 

where S > is an absolute constant depending only on s and C. 

The error bound of 0(N~ 1 / 2+ ) might not be sharp; any improvement in this 
estimate will lead to a better polynomial growth bound than (3.1). It may be that 
one can improve this result by adding suitably chosen "correction terms" to E^(t), 
in the spirit of [10], [14]. 

• Step 1. Deduction of (3.1) from Lemma 3.1. 

If we assume Lemma 3.1, then we may iterate it to obtain Ejy(t) < 1 for all 
< t <§; N 1 / 2 ^ . In particular, if we assume T <§; N 1 / 2 " , we have from Plancherel 
and (3.3) that 

\\u(T)\\h° < N s E N (Tf/ 2 < N s . 
Optimizing N in terms of T we obtain (3.1) as desired. 

It remains to prove Lemma 3.1. This is done in several stages. 
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• Step 2. Control u at time to- 

By hypothesis we have 

||2Mto)||2<l. (3-5) 

As one can see from (3.2), this is essentially an fP-type bound on u (up to powers 
of N). However, in order to obtain good polynomial growth bounds it is important 
that we use the norm ||2?u||2 throughout rather than 

• Step 3. Control u on the time interval [to — S, to + 5]. 

We now use (3.5) to claim 



||Pu|| Y o,i/2 +E <1 (3.6) 

[*o-<Mo + s] 

for any < £ <C 1, if < <5 1 is a sufficiently small constant (depending on s and 
the bound in (3.5), but not on N). This will be achieved by techniques similar to 
those used to obtain local well-posedness using the X s ' b spaces (as in e.g. [2]). 

To begin with, by the standard energy estimate for X s - b spaces (see e.g. [2], or 
[16], page 771; note that the multiplier V commutes with the Schrodingcr operator 
id t + d xx and is thus harmless) we have 

||©u||„0,l/2+. < ||2Mt )|| L 2 +S a+ \\V(tU t +U xx )\\ 0,-1/2 + 2, . 

[to-*.'o+ s ] [*0- 5 '*0+ 5 ] 

Since we are allowed to choose 5 to be sufficiently small, it suffices from (3.5), (1.1) 
and standard continuity (or iteration) arguments to show the trilinear estimate 

||£ > («lU2ti3)||o,-l/2+2e ^ ||£>«1 ||o,l/2+ ||^"2 ||o,l/2+ ll^«3 ||o,l/2+ ■ 

We may assume that the Ui have non- negative Fourier transforms. Since w(^ + rj) < 
w(£) + w(r/) we can obtain a fractional Leibnitz rule, and reduce to showing 

WD(ui)U2U 3 \\o ,-l/2+2e ^ ll^l llo,l/2+ 11^2 ||o,l/2+ 11^3 ||o,l/2+ 
||tilX>(u 2 )ti3||o ,-l/2+2e ^ l|I ) '"l||o,l/2+||I ) 'W2||o,l/2+||^W3||o,l/2+ (3.7) 
||tilU2X > (U3)||o,-l/2+2e ^ 11^1 ||o,l/2+ \\T>U2 ||o,l/2+ \\T>Uz || ,l/2+- 

,/From the dual of (2.2) we see that ||/||o,-i/2+2e S 11/11 r^/a. The claims (3.7) then 

x.t 

follow from Holder's inequality and several applications of (2.2). This gives (3.6). 

• Step 4. Control the increment of the modified energy. 

To prove (3.4), we now apply the fundamental theorem of Calculus to write the 
increment of the modified energy (3.3) as 



fta+S 

E N (t + 5)- En (to) = / d t E N (t) dt. 

J to 



(3.8) 
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A routine integration by parts 10 shows that 
d t E N {t) = 2{Vu t ,Vu) 

= 2{iVu xx + WF(u),Vu) 
= 2{iuuu, T> 2 u) 

= -2ImA 4 (6 {£ 4 ) 2 ;u(t),^,u(t),~ultj) 

= IlmA 4 (#(a) 2 - 0(&f + 9(^) 2 - d(U) 2 ;u(t),W),u(t),^t)), 
where in the last step we exploited the symmetry 



A 4 (0(&) 2 ; «(*), u{t), u(t), u(t)) = A 4 (6^ k ) 2 ;u(t),u{t), u(i), u(*)) 

for j = 1,3 and k = 2,4. 

^.Frorn the above computations, it suffices to show the estimate 

r 4 

/ X[to,to+5]W A 4(^4^lW,W2W,"3W,"4W) &\ < N~^ 2+ J[ \\V Ui || 0> 1/2+ 

for all functions ui,u 2 ,u 3 ,u 4 € X 0,1 / 2+ , where M 4 = M4(£i, £2, £3, £4) denotes the 
symbol 

M 4 (£i, := 0(6) 2 - ^(6) 2 + ^(6) 2 - ^(6) 2 . 
We shall argue similarly to [13]. We apply Littlcwood-Paley decompositions, 
and assume that Uj is supported on the region ~ for some dyadic Ni > 1; 
of course, we will eventually have to sum in JVj to recover the general case. Define 
{soprano, alto, tenor, baritone} = {1,2, 3, 4} by requiring 

Nsoprano _^ ^alto _^ Aftenor _^ Afb ar itone • 

We may assume that N soprano ~ iVazto since A 4 is integrating over the region 
£1 + £2 + £3 + £4 = 0. We may also assume that N soprano > N since the symbol 
0(6) 2 " <?(6) 2 + <?(6) 2 - fi 1 ^) 2 vanishes otherwise. 
We divide into two cases. 

• Case (a.): N soprano ^> ^baritone* 

In this case M 4 = 0(9(N soprano ) 2 ), so we can essentially 11 bound this term by 

/rto+S 
j \ u soprano \ \^alto I Interior \ \^baritone I dxdt. 

Jto 

We use Holder's inequality to take u alto and u tenor in L x t , and u soprano u ba ritone 
in L 2 t . Using two applications of (2.2) and one application of (2.4) we can bound 
this by 

4 4 
0{N soprano fN- p l r l no J] |k|| ,l/2+ < ^Loll ll^llo,i/2+. 

i=l i=l 



10 This is of course related to the usual energy method computations to control the growth of 
higher order energies; see also [3], [24], [8]. Observe that if T> was the identity then this derivative 
would vanish, thus giving the standard proof of L? norm conservation. 

11 To be more precise, one should replace «; by Mu{, the Hardy- Littlewood maximal function 
of Uj. This is because the symbol M4 is equal to 9{N aopra no) 2 times a smooth multiplier which 
obeys good kernel estimates. We omit the details. An alternate approach is to reduce to the case 
when all the ui have non-negative spacetime Fourier transform (which is legitimate since the X s ' b 
norms do not care about the phase of Uj), however the time cutoff X[t ,t +5] (*) t nen presents an 
annoying technical difficulty (since its Fourier transform is not non- negative) . 
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The claim then follows by summing in the N{. 

• Case (b)l ^soprano ~ -^baritone' 

In this case all the frequencies are comparable. The key observation is that for 
|&| ~ N soprano and £1 + £ 2 + £3 + £4 = 0, we have 

I M4 1 0(N sopran o) 2 

~ ^2 • 



I p 2 _ f2 1 C2 _ <r 2 I ~ ^2 
SI S2 ~S3 S4 JV soprano 

To see this, we write the denominator as 

a 2 - a 2 + a 2 - a 2 = (a - axa + 6) + (6 - axa + a) 

= 2(£ 1 +60(£i+6) 

and the numerator as 

M 4 =^(a) 2 -^ 2 ) 2 +^3) 2 -^4) 2 
=^(6 + Ki + 6) + (a + a)) 2 - e(6 + (a + a)) 2 

+ 0(a) 2 - 0(6 + (a + a)) 2 ; 

the claim then follows from the double mean value theorem and the estimate 

^ 2 (QIC^W S{N S op r ano) 2 

l-pjWO )| < — p 

"S ^ soprano 

for all £ = 0(N soprano ). 

To use (3.9) we must use the spacetime Fourier transform, which requires us 
to first modify the cutoff X[t ,*o+<5] (*)• Write X[t ,t +5](t) = a(t) + b(t), where a(t) 
is X[t ,t +5](t) convolved with a smooth approximation to the identity of width 
N-pZno, and b(t) = X[t ,t +S](t) - a(t). 

Consider the contribution of b(t). This term is essentially bounded by 

e{N sopranof j J |6(t) | |ui (t) 1 1 U 2 (t) 1 1 U 3 (t) 1 1 U 4 (t) \ dxdt; 

by Holder's inequality and (2.3) we may bound this by 

4 

N sop°rano Y\_ ll U illo,l/2+ 
i=l 

which easily sums to be acceptable. 

Now consider the contribution of a(i). In light of the estimate 

H*)wi||o,l/2+ < ^°ironoll u l|lo,l/2+ 

(see [13]) it suffices to show 

r 4 

/ A4(M 4 ;ui(t),U2(t),«3(t),U4(t)) dt< N^. ano J] ll^llo,i/2+. 

i=l 



40 



J. COLLIANDER, M. KEEL, G. STAFFILANI, H. TAKAOKA, AND T. TAO 



We may assume that the spacetime Fourier transforms of Ui are non-negative. By 
Plancherel and (3.9) it suffices to show that 



/ / \&- £2 +d - e4 2 |wi(a,n)^(6,T-2)M 3 (6^3)^4"(e4,T4) 

" , Ci+e2+C 3 +?4=0 Jri+T 2 +r 3 +r 4 =0 

4 

< N soprano J^J ||o,l/2+- 
i=l 

^From the triangle inequality we have 

\& - e 2 2 + 4 2 - e 2 i < in - e 2 i + \r 2 + e 2 \ + \rs -e 3 \ + ir 4 + e,\-, 

interpolating this with the trivial bound of N^ oprano we obtain 
By symmetry, it thus suffices to show 

1/ / |Tl -^i| 1/2 Ui(^i,Ti)ui(^ 2 ,r 2 )u3(C3,T3)?Il(C4,T4)| 

"'ei+«2+C3+e4=0 J T 1 +T 2 +T 3 +T i =0 

4 

By Plancherel and Holder's inequality the left-hand side is bounded by 

ll^Cn -Cl| 1/2 Wl)||z,2i,2||M2||LfZ,6||w3||LfZ,6||w4||LfL6 

which is acceptable by (2.1) and the definition of X ^^ 2+ . This completes the proof 
of Lemma 3.1, and hence of Theorem 1.1. 

4. Proof of Theorem 1.2: first attempt 

We now prove Theorem 1.2. We shall begin by giving a simplified version of 
the argument which does not capture the full power of dist#s(wo,£) in (1.7), but 
introduces the main ideas and establishes the key multi-linear estimates (Lemma 
4.2). In the next section, we shall give a better version of this argument which 
recovers most of this power in the next section, and then in the section after that 
we shall give the most refined version of the argument that gives (1.7) with no loss. 

This argument is very similar to the previous one, and we have deliberately given 
the two arguments a nearly identical structure. 

• Step 0. Preliminaries; introduction of the modified energy. 

By the existing global well-posedness theory and a standard limiting argument 
we may assume that it is a global smooth solution which is rapidly decreasing in 
space. 

Fix < s < 1, u, and let N ^> 1 be chosen later. Let m(£) be a smooth even 
real-valued symbol such that m(£) = 1 for |£| < N and m(£) = l^^/TV 5-1 for 
£| > 27V, and let / be the Fourier multiplier 

Tf(0 :=m(0/(0- 

Thus / is the identity for frequencies |£| <C N and is smoothing of order 1 — s for 
high frequencies |£| > N. 
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In analogy with the quantity (3.3) used in the proof of Theorem 1.1, we define 
the modified energy 12 £7jv(i) by 

E N (t) := L(Iu(t)), (4.1) 

where the Lyapunov functional L() was defined in (1.4). 

We now estimate En(0). Write a := dist#s (u , E), thus by hypothesis < a <C 1 
and there exists a ground state Q G £ such that 

||«o - Q\\h" < o-. 

Applying /, we see that 

\\Iu -IQ\\ m <N^ s a. 

On the other hand, since Q is smooth, its Fourier transform is rapidly decreasing, 
and 

\\IQ-Q\\hi<N- c (4.2) 

for any C. Thus, if we assume 

N>a°~, (4.3) 

we then have 

||/«o - Q\\m < N 1 -^. 

By (1.6) we thus have 

E N (0)-L(Q)<N 2 - 2s a 2 . 
We shall make the assumption that 

N 2-2 S(J 2 < L ( 4 4 ) 

The heart of the argument shall lie in the following almost conservation law for 
the modified energy En. 

Lemma 4.1. If to € R is such that Ejy(to) — L(Q) <C 1, then we have 

E N (t + S)=E N (t Q ) + O(N- 1+ ) (4.5) 
where 8 > is an absolute constant depending only on s. 

As in the previous section, we do not know if the error bound 0(N~ 1+ ) is sharp; 
any improvement in this error estimate will ultimately lead to an improvement of 
the t 1 ~ s+ factor in (1.7) (if one uses the most refined version of the argument below, 
see Section 6). 

• Step 1. Deduction of a weak form of (1.7) from Lemma 4.1. 

If we assume Lemma 4.1, then we may iterate it to obtain Sjv(t) — L(Q) < 
N 2 - 2s a 2 for all 

1 < f <C N 1 ^ N 2 ~ 2s a 2 , (4.6) 

where we assume 

N 1- N 2-2 S(J 2 > 1 ( 4 ? ) 

(note that this automatically implies (4.3)). 



12 In a more standard application of the /-method, e.g. [9], [11], [12], [20]) we would take 
^Jv(*) = H(Iu(t)). The main difference here is thus to add an L 2 component to the modified 
energy in order to make the ground state cylinder an approximate minimizer of the energy. 
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Fix t as above. Then by (1.6) we have 13 

\\Iu{t)-Q{t)\\ m <N l -'a 

for some ground state Q(t) G E depending on t. Using (4.3) as before, we may 
modify this to 

\\Iu(t) - IQ(t)\\ m < N^'a 

which implies that 

\\u(t) - Q(t)\\ H s 

and hence that 

distil («(*), E) < A^V. (4.8) 
Optimizing in A^ subject to (4.4), (4.6), (4.7) we obtain 

distil {u(t), E) < £3^0-3^ (4.9) 

whenever t <C er -1 ^ 1-15 * 1- , which is a weak form of (1.7). In the next two sections 
we shall use more refined arguments to improve this bound. 

It remains to prove Lemma 4.1. This shall be done in several stages. 

• Step 2. Control u at time to. 

,/From the hypothesis Stv(^o) — L(Q) <C 1 and (1.6) (or the Gagliardo-Nirenberg 
inequality) we have 

||/u(to)||*i <1- (4-10) 

Up to powers of N, this is basically an H s bound on u(to), but to obtain good 
exponents it is important that we work with the ||Ju||#i norm rather than the 

• Step 3. Control u on the time interval [to — S, to + 6]. 

We now claim that (4.10) implies the spacetime estimate 

||Ju|| x .i,i /a+ . <1 (4.11) 

[*o-<Mo+ a l 

if < 5 <C 1 is a sufficiently small constant and < e <^ 1. As before, it suffices to 
show the estimate 

\\I(uiU2U 3 )\\l ,-1/2+e ^ ll^llll, l/2+||-fM2||l,l/2+||-f«3||l, 1/2+- (4-12) 

We can write the left-hand side as 

||/(V)(uiti2U3)||o ,-1/2+e) 

where /(V) is the multiplier with symbol m(^)(^). Since m(^+rj)(^+rj) < rn(^)(^) + 
771(77) (77), we can use the fractional Leibnitz rule, together with (2.2) and its dual, 
to obtain this estimate. 

• Step 4. Control the increment of the modified energy. 



Strictly speaking, to apply (1.6) we must assume beforehand that Iu(t) is close to a ground 
state. But by hypothesis this is true at time t = 0, and so one may proceed by a standard 
continuity argument which we omit. This continuity argument will also be used later in the paper 
without further mention. 
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Now that we have obtained (4.11), the next step is again the fundamental the- 
orem of Calculus (3.8). We introduce the nonlinear functional Q(v), defined on 
smooth (H 2 will do) functions v on R x R by the formula 

rto+6 

fl(v) := / (il(v(t) xx + F(v(t))) , -Iv{t) xx + Iv(t) - F(Iv(t))) dt. 

Jt ° (4.13) 

^Frorn (3.8), (1.5) and (1.1) we have 

E N {t + 6)-E N {t ) = 2fl{u), (4.14) 

so in view of (4.11) it suffices to prove the following estimate, which we shall re-use 
in later sections. 

Lemma 4.2. We have 

\n(v)\<N-i+ 

whenever 

\\Iv\\ x l.l/l+e <1. 

We now prove Lemma 4.2. By repeated integration by parts and a symmetriza- 
tion (cf. the computations after (3.8)) we can expand the integrand in (4.13) as 

(ilv xx , -F(Iv)) + (UF(v), -Iv xx +Iv- F(Iv)) 

= (ilv xx , -F(Iv) + IF(v)) + (iF{v),I 2 v) + (iIF(v), -F(Iv)} 

=Re(zA 4 (-£ 1 m(£i)(~m(£2)m(£ 3 )m(£4) + m(£ 2 + 6 + &)); v,v, v, v) 

+ zA 4 (m(£ 4 ) 2 , v, v, v,v) 

-iA 6 (m(£i +6 +&)rn(£ 4 )rn(£ 5 )m(£ 6 )-,v,v,v,v,v,v)) 
=Rc(CiA 4 (M4;w,TJ,w,iJ) + C 2 A 4 (M^;v,v,v,v) + C z k 6 {M & ;v,v,v,v,v,v)) 
where C\ , C 2 , C 3 are imaginary constants and 



M'l 

Mr 



' 2 "ii\ 

= m(£ 1 ) 2 -m({ 2 ) 2 +m(£ 3 ) 2 -m(U) 2 

= m(£i + 6 + €3)m(€4,)m(€ 5 )m(€ 6 ) - w(6)w(^ 2 )m(^ 3 )m($ 4 + £ 5 + £ 6 ). 



It thus suffices to show the bounds 

fta+S 



/ A4(Mi;«i(t),« 2 (t),« 3 (t),tt4(t)) <ft| < JV- 1 JJ||/« i || li i /2+ 

i=i (4.1 

rto+S 4 

/ A 4 (M; ;Ul (t),u 2 (t), U3 (t),u 4 (t)) dt\ < N- 1 JJ 11/^111,1/2+ 

i=i (4.1 



(4.16) 

/■*o+<5 6 

| / A 6 (M 6 ;ui(t),u 2 (t),u 3 (i),tI 4 (i),«5(i), : W6(i)) &| < JVT 1 JJ ||Ju i || 1 ,i /2+ . 

i=i (4.17) 

As before, we first restrict the Fourier support of Ui to the region (&) ~ A/j for 
some Ni>l, and promise to sum in the A 7 * later. 

• Step 4(a): Proof of (4.15). 
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We first prove (4.15). We shall assume that N2 > N3 > N4; the other cases are 
similar. We may then assume N2 > N since the symbol vanishes otherwise. 
We split into two cases: N2 ^> A3 and N2 ~ A3. 

• Case 4(a). 1: N 2 > N 3 . 

We may assume that Ni ~ N 2 since £1 +£2 +£3+^4 = 0. We then bound the sym- 
bol M' A by N^m(N 1 )m(N 2 ) ~ N 1 m(N 1 )N 2 m(N 2 ), and estimate this contribution 

by 

N 1 m(N 1 )N 2 m(N2) f /|ui||w 2 |K|K|. 



Applying Cauchy-Schwartz followed by two applications of (2.4) we obtain a bound 
of 

4 4 
Nl /2 m{N l )N 1 2 /2 m{N 2 ) J] IKIIo,i/2+ £ N~ 1/2 N 2 ~ 1/2 [J ||/ Ui ||i,i/ 2+ . 

i=l i=l 

Summing in the Ni we see that this case is acceptable (we lose some logarithms of 
Ni and N2 from the A^ and A^ summation, which is why we only end up with a 
bound of N~ 1+ instead of A^ 1 ). 

• Case 4(a). 2: N 2 ~ A3. 

Since (£2 + £3 + £4} ~ Ni, we may bound the symbol M' A by N^m(Ni) 2 , and 
estimate the contribution by 



nMNi) 2 J J MMKIKI- 

We bound Nfm(N 1 ) 2 < A r im(A r i)A r 2 m(A r 2 ) and apply (2.2) four times to bound 
this by 

4 4 

N 1 m(N 1 )N2m(N2) J] ||«i||o,i/2+ £ ^ m(JV ) II ll J «illi,i/2+- 

i— 1 3 V 3^ 

Summing in the A^ we see that this case is acceptable (again, the Ni and N4 
summations cost us some logarithms). 

• Step 4(b): Proof of (4.16) and (4.17). 

Now we prove (4.16), which is rather easy due to the lack of derivatives in the 
symbol. We may assume that N\ > N2 7 N 3 ,N 4 . We may assume that Ni > N 
since the symbol vanishes otherwise. Then if we bound the symbol M'l by O(l) 
and use (2.2) we estimate this term by 

4 1 4 

n kho,i/2+ % N , N) n n ju iiii,i/2+- 

i=l ^ 1 i=l 

Summing in the Ni we see that this case is acceptable, again losing some logarithms 
in the A2, A3, N4 summations. (Indeed, it is clear one could extract far more decay 
from this term if desired). Finally, the estimate (4.17) is similar to (4.16) (just use 
(2.1) instead of (2.2)). 

This concludes the proof of Lemma 4.2, hence of Lemma 4.1, which gives a weak 
version of Theorem 1.2. 
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5. Proof of Theorem 1.2: second attempt 

In the previous section we gave a partial proof of Theorem 1.2, but with the 
wrong power of distjjs (uq, £). The problem was that we were not really exploiting 
the fact that u(t) was close to the ground state cylinder S; for instance, the bound 
(4.10) would still be true if u was at a distance ~ 1 from the ground state cylinder 
in the metric. To improve upon these results we must consider not just 

u(t), but also the difference w(t) := u(t) — Q(t) between u and an appropriate 
ground state Q(t) S S. In particular we wish to exploit the fact that w has small 
norm (with a bound which depends linearly on a). 

In this section we use the above ideas to refine the argument of the previous 
section, and obtain a near miss to (1.7). Unfortunately our power of distjys (u , S) 
will still be off by an epsilon, mainly because the smoothing operator / does not 
quite preserve the ground state cylinder (so that IQ{t) is not a ground state). 
To fix this problem and get the sharp power of dist/fs (u , E) requires some further 
refinements which wc delay until the next section in order to simplify the exposition. 

We now turn to our second attempt at proving Theorem 1.2, deliberately re- 
peating much of the structure of the arguments from previous sections. 

• Step 0. Preliminaries; introduction of the modified energy. 

We make the same reductions as the previous section, and leave the definition of 
the modified energy Ejy(t) from (4.1) unchanged. The main difference is that we 
sharpen Lemma 4.1 to 

Lemma 5.1. J/( eR is such that E N (t Q ) < L(Q) + a 2 for some N~ c < a <C 1 
for some arbitrary constant C, then we have 

E N (t Q + 5)=E N (t Q ) + 0(N- 1+ & 2 ) (5.1) 

where S > is an absolute constant depending only on s. 

As in the previous section, we do not know if the factor 0(N~ 1+ ) can be im- 
proved. The quadratic exponent a 2 is probably sharp, however, since the derivative 
of -E/v(^o) w ih contain terms which are quadratic in the difference w(t) = u(t)—Q(t). 

• Step 1. Deduction of a slightly weakened form of (1.7) from Lemma 
5.1. 

Assume Lemma 5.1 for the moment. We substitute this lemma, with a ~ N 1 ~ s cr, 
in place of Lemma 4.1 in Step 1 of the previous section; note that a is admissible 
by (4.3). The argument then proceeds as before, with (4.6) replaced by 

1 < t < N 

(and (4.7) is no longer needed). For this range of t we obtain (4.8). Optimizing in 
N subject to (4.3), (4.4), (4.6) we find that we may improve (4.9) to 

dist^ (u(t), E) < t^+a 1 - (5.2) 

whenever 

t < ct^t^ - . 

This is within an epsilon of (1.7). To remove this last epsilon we shall need a more 
refined argument, presented in the next section. 

It remains to prove Lemma 5.1. This shall be done in the usual sequence of 
stages. The main difference is that, instead of controlling u, we shall control a 
difference w(t) = u(t) — Q(t) between u(t) and a suitably chosen ground state Q(t). 
This will let us recover the powers of a in (5.1). 
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• Step 2. Control w at time to- 

Fix to- By (1.6) there exists a ground state Qt 6 £ depending on to such that 

\\iu{t ) - Q to \\m 

The above estimate asserts that it (to) is in some sense close to Qt . If u(to) was 
in fact equal to Qt , then the evolution of u(t) would follow the curve of ground 
states 14 Q(t) : R -> S defined by 

Q(t):=e^-^Q t0 . (5.3) 

Thus it is natural to define w(t) := u(i) — Q(t). By (4.2) and the previous we have 

IIM*o)ll^ (5.4) 

where we have used the hypothesis that a > N for some C . 

• Step 3. Control w on the time interval [to — 5, to + 5]. 

The idea is now to run a local well posedness argument for w instead of u in 
order to gain the powers of u in (5.1). Specifically we wish to obtain the spacetime 
estimate 

||Jio|| v i,i/2+£ < a (5.5) 

for some small absolute constants < S <C 1 and < e <C 1. 

To obtain this, we observe from (1.1) and (1.2) that w obeys the difference 
equation 

iwt + w xx = -G(w(t),Q(t)) (5.6) 

where G is the nonlinear expression 

G(w(t),Q(t)) := F(Q(t) + w(t)) - F(Q(t)). 

The exact form of G is not important, save for the fact that G is cubic in Q(t), 
Q(t), w, w, and is always at least linear in w, w. By (5.4), (5.6) and the standard 
X s ' b energy estimate (as in previous sections), we thus have 

\\Iw\\ x ^ + c <a + <5°+||/GH|| xl ,- 1/2+2e . 

[t a -S,t + S] ^{t -5,t +S] 

By several applications of (4.12), and the observation that Q(t), Q(t) are Schwartz 
in x and thus live in every (time-localized) X s ' b space, we therefore have 

\\Iw\\ xl , 1/2+ e <or + i°+||J«,|| lll/2+e +S a+ \\Iw\\ xl , 1/2+E . 

The claim (5.5) then follows if 5 is small enough by standard continuity (or iteration) 
arguments. 

• Step 4. Control the increment of the modified energy. 

We now prove (5.1). By (4.14) it suffices to show that 

fl(Q(t)+w(t)) = 0(N- 1+ a 2 ). 

We could do this by direct computation, but we present instead a simple argument 
(based on isolating the terms in Q(Q(t) +w(t)) which are linear in w) which obtains 
this bound as a nearly automatic consequence of Lemma 4.2. 

This heuristic is a little inaccurate because of the presence of the I, and also because the 
error Iu(to) — Qt a can affect the modulation parameters of the approximating ground state. We 
address these issues in the next section, when we remove the epsilon losses which arise from the 
arguments in this section. 
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To prove the above estimate it suffices by the hypothesis a > N c to prove the 
more general bound 

fi(Q(i) + k^-) = 0(iV- 1+ |fc| 2 ) + OiN- '- 1 ^) (5.7) 
a 

for any real number k such that |fc| < 1. 

To prove (5.7), first observe from (4.13) that the left-hand side of (5.7) is a 
polynomial P(k) in k of degree at most 6 (with the coefficients depending on Q(t), 
w(t), a, of course). From (4.13), (1.2), and integration by parts we see that the 
constant term of P is zero. Also, from Lemma 4.2 and (5.5) we see that the left- 
hand side of (5.7) is 0(N- 1+ ) for all |fc| < 1. Thus all the coefficients of P{k) are 
0(N~ 1+ ). To finish the argument it suffices to show that the linear term of P(k) 
is 0(7V~ C ~ 1 ). Equivalently, it suffices to show that the portion of fl(Q(t) + w(tj) 
which is linear in w, W is 0{N^ c ^ 1 n). 

To show this we return to (4.14), and exploit the fact that IQ(t) is nearly a 
minimizer of L. If we compute Eff(t) = L(IQ(t) + Iw(t)) using (1.4) and suppress 
terms which are not linear in w, w, we obtain 

E N {t) =2{IQ(t) x , Iw{t) x ) + 2(IQ(t), Iw(t)) 

- 2(F(IQ(i)),Iw(t)) + nonlinear terms 

Integrating by parts and using (1.2), we obtain 

EN(t) = 2(w(t), R(t)) + nonlinear terms 

where 

R(t) := I(IF(Q(t)) - F(IQ(t))). 

Differentiating this in time using (5.6), and again suppressing nonlinear terms, we 
obtain 

d t E N (t) =2(iw xx (t) + 2iw(t)Q(TjQ{t) + iQ(t)w(t)Q(t),R(t)) 
+ 2(w(t),d t R(t)) + nonlinear terms 

We can bound the linear terms by (for instance) 

< ||^)ll2(||i2(t)||^+||W)||2). 

Because Q(t) is Schwartz, the operator I is almost the identity on Q(t) or F(Q(t)), 
and so it is easy to see that 15 

11^)11^ + || W)|| 2 < N- c -\ 

Also from (5.5) we have ||u;(i)||2 < a- Thus the linear part of Q(Q(t) + w(t)) is 
0(N~ c ~ 1 a) as desired. This proves (5.1), which then almost gives (1.7). 

-^Note that the time derivative dt on R(t) is not dangerous because the time-dependence of 
Q(t) and R(t) are given by a simple phase rotation e %t . In the next section however we shall need 
a more sophisticated time evolution for the approximating ground state Q(t). 
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6. Proof of Theorem 1.2: final argument 

In the previous sections one had to assume (4.3) in order to use the approximation 
Q s=s IQ. Ultimately this assumption caused us to miss (1.7) by an epsilon. In order 
to avoid this loss we shall need to avoid using the approximation Q w IQ, at least 
in situations in which one does not have enough powers of a in the estimates. 

• Step 0. Preliminaries; introduction of the modified energy. 

This step is the same as in the previous section, except that we need to replace 
the modified energy L(Iu) from (4.1) by a slightly different quantity (because IQ 
is not quite a minimizcr of L). 

To motivate the argument we first recall the more precise statement of Wein- 
stein's estimate (1.6): 

Lemma 6.1. [27], [26] Let Q e £ be a ground state, and let w 6 H 1 obey the two 
orthogonality conditions 16 

(w, AF{Q)) = for A = i,d x 
Then, if the H 1 norm of w is sufficiently small, we have the coercivity estimate 
L(Q + w)- L(Q) = L(Q + w)- L(Q) ~ \\w\\ 2 m . 

Note that the two anti-selfadjoint operators A = i,d x are the infinitesimal gen- 
erators of the phase rotation and translation groups respectively, both of which 
preserve the ground state cylinder S. 

We shall need to apply this lemma to estimate quantities of the form L(Q(t) + 
Iw(t)) (which will be our substitute for L{Iu)). Thus we shall require w to obey 
the orthogonality conditions 17 

(w{t),AIF(Q(t))) =0 for A = i, d x (6.1) 

for all times t. 

To use Lemma 6.1 we thus need a decomposition u — Q(t) + w(t) of the solution 
u which obeys (6.1). This is the purpose of the following Lemma. 

Lemma 6.2. Ifu 6 H S (R) is such that dist# s (u, S) <^ N 11 " 1 , and N is sufficiently 
large depending on s, then we may decompose u = Q + w, where Q £ X is a ground 
state, w obeys the orthogonality conditions 

(w,AIF(Q)) =0 for A = i,d x (6.2) 

and the bound 

\\Iw\\& < A^-Mist^KE). (6.3) 
Proof Define the metric d on H s by 

d(u,v) := ||/(« - v)\\ H i = ((I(u - v),I(u - v)) + (d x I(u - v),d x I(u - v))) 1/2 . 
Clearly we have 

d(u, S) < A^-Mistjfa (u, S) < 1. 

We now claim that there exists a ground state Q' in E which minimizes d(u,Q'), 
so that d(u,Q') = d(u,T,). To see this, first observe (since u was assumed to be 

16 If 

one writes u := Q + w, then these orthogonality conditions have the geometric interpreta- 
tion that Q is the closest ground state in E to u, as measured in H 1 norm. 

17 This trick of choosing Q(t) to obey carefully selected orthogonality conditions is very common 
in stability analysis, see e.g. [22]. 
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smooth and rapidly decreasing) that (Iu, Ie l8 Q(x — xo)) — > as xo — ► ±00, and 
similarly for (d x Iu, d x Ie 'Q(x — xo)). By orthogonality we thus have 

d(u, e t6 Q{x - x )) -► {\\Iu\\ 2 m + \\IQ\\ 2 H1 )^ 2 > 1 » d(u, E) 

as Xo — ► ±00. Thus, in order to minimize d(u, Q') on E, it sulRces to restrict Q' to 
a compact subset of the cylinder E. By compactness and smoothness we then see 
that a minimizer Q' must exist. 

Observe that the statement and conclusion of Lemma 6.2 is invariant under 
translations and modulations of u (and hence of Q and w). By these invariances 
we may thus assume that the minimizer Q' is attained at Q' = Q, thus 

d(u, Q) = d{u, E) < N l - s dist Hs (u, E) < 1. (6.4) 

The tangent space of E at Q is spanned by iQ and Qa;. Differentiating 

d(u, Q f = (I(u - Q),I(u - Q)) + (d x I(u - Q), d x I(u - Q)> 

in these directions, we thus see that 

(I(u - Q), HQ) + (d x I(u - Q), HQ) = 0; 
(I(u - Q),IQ X ) + (d x I(u - Q),d x IQ x ) = 0. 

Integrating by parts and using (1.2) we obtain 

(w,AI 2 F(Q)} =0 for A = i,d x (6.5) 

where w := u — Q. This is almost what we want, except that we have I 2 instead of 
/. To rectify this we shall need to use perturbation theory to shift and modulate the 
ground state slightly. In other words, we set Q := e t6 Q(x — xo) for some \6\, |xo| <1 
to be chosen later. Writing q := Q — Q and 

w := u — Q = w — q, 

we see from (6.5) that (6.2) becomes 

(w - q, AIF(Q + q)) - (w, AI 2 F{Q)) = for A = i, 8 X . 

We rearrange this as 

(q, AIF(Q + q)) - (w, AI(F(Q + q) - F(Q))) = (Iw, A(l - I)F(Q)) for A = i, 8 X . 

(6.6) 

Since F(Q) is Schwartz, the right-hand side is 0(N- lm \\Iw\\ H i) = 0(N-"dist H s (u, E)) 
by (6.4). Now we expand the left-hand side to first order in 8, £ - Observe that 

q = Q-Q = 9iQ- x a Q x + H 2(\8\ 2 + |x | 2 ) 

where H 2(X) denotes a quantity whose H 2 norm is O(X). Similarly we have 

F(Q + q)- F(Q) =9G + x Q H + O m {\9\ 2 + |x | 2 ) 

where G, H are explicit Schwartz functions whose exact form is not important for 
us. Thus (6.6) becomes 

0(iQ, AIF(Q)) - x Q (Q x , AIF(Q)) - 6{w, AIG) - x Q {w, AIH) 

= 0(N-"dist s H (u, E)) + 0{\9\ 2 + |x | 2 ) for A = i, 8 X , 
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which we can write as a matrix system 

( (iQ,UF(Q))-(w,UG) -(Q x iIF(Q)) - (w,UH) \(0 \ 
^ (iQ,d x IF(Q)) - (w,8 x IG) -(Q x d x IF(Q)) - (w,d x IH) J \ x J 

= 0(N-"dist s H (u, £)) + 0(\6\ 2 + \x \ 2 ). (6-7) 

Since G and H are Schwartz, we have 

(w, AIG), (w, AIH) = 0{\\lw\\ m ) = 0(d(u, Q)) -C 1 for A = i, d x 

by (6.4). Also, an easy integration by parts using (4.2) gives the estimates 

(tQ, l IF(Q)) = \\Q\\i + O(N- W0 ) 

(Q x , iIF(Q)}, (iQ, d x IF(Q)) = O(N- W0 ) 

{Q x ,d x IF(Q))=3 J Q 2 Ql + O(N- W0 ). 

Thus if N is large enough, the matrix in (6.7) has a non-degenerate Jacobian at 
the point (9, Xo) = (0,0). By the inverse function theorem (or the contraction 
mapping theorem) we can thus solve (6.6) for some #,x = 0(./V~ 99 dist#s (u, £)). 
The condition (6.3) then follows from (6.4) (since the distance between Q and Q is 
0{\6\ + \x \) = 0(N~" distil (u, S)) in any reasonable norm). ■ 

Applying this Lemma at each time t we thus have ground states Q(t) and a 
function w(x,t) such that u(t) — Q(t) + w(t) and the orthogonality relations (6.1) 
hold for all times t for which distil (u, S) -c iV s_1 . (We can ignore this latter 
condition by standard continuity arguments, since we will eventually verify this 
hypothesis (with some room to spare) at the end of the argument). 

We now redefine the modified energy E N (t) as 

E N (t):=L{Q(t) + Iw(t)). (6.8) 

This should be compared to the energy L(IQ(t) + Iw(t)) from (4.1) used in the 
previous section. From Lemma 6.1 we have the analogue of (1.6) 

E N {t)-L{Q)~ \\Iw(t)\\ 2 H1 (6.9) 

whenever ||/w(i)|| ff i is sufficiently small. In particular, at time zero we see from 
Lemma 6.2 that 

^(0)-L(O)^||M0)|| 2 ffl <iv 2 - 2 V. 

The analogue of Lemma 5.1 is 

Lemma 6.3. If to E R is such that Sjv(io) < L(Q) + cr 2 for some < a -C 1, then 
(5.1) holds for some S > which is an absolute constant depending only on s. 

• Step 1. Deduction of (1.7) from Lemma 6.3. 

We repeat Step 1 of the previous section, using (6.9) as a substitute for (1.6). 
The main differences are that there is no hypothesis of the form u > N~ c in Lemma 
6.3, so that there is no need to assume (4.3). We omit the details. 

Note that while there is some logarithmic losses in the 0(N~ 1+ ) factor of (5.1), 
there are none in the a 2 factor. This is what allows us to get the sharp power of 
distil (uo, S) in (1.7), although the power t 1 ~ s+ is probably not sharp. 

It remains to prove Lemma 6.3, which we divide into the now-familiar sequence 
of steps. 

• Step 2. Control w at time to. 
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We now prove Lemma 6.3. Fix to- By (6.9) we again obtain (5.4). 

• Step 3. Control w on the time interval [to — S, t + 6]. 

As before, the next step is to obtain (5.5). It will be convenient to write Q(t) 
more explicitly as 

Q{x, t) =: e i0 ^e u Q{x - x (t)) (6.10) 

where the modulation parameters 6(t), xo(t) are real-valued; we shall shortly derive 
equations for the evolution of these parameters. This ansatz should be compared 
with (5.3); note that the standard ground state u(t) = e lt Q occurs when 6,x ,w 
all identically vanish. 

^From (1.1), (1-2), (6.10) we see that w obeys the difference equation 

iwt + w xx = -G(w(t), Q(t)) - Q(t) - iQ{t) t 

(6.11) 

= -G(w{t), Q(t)) + 9(t)Q(t) + ix (t)d x Q(t) 

where G was defined in the previous section. We may thus repeat Step 3 of the 
previous section, provided that we can show the estimate 



\\6{t)Q(t) + ix {t)d x Q{t)\\ x i.-i/2+e < \\Iw 



1,1/2 + 



X '~ ' /-^j II II X 

^[t -S,t + S] ^[t -5,t + 5] 

Since Q(t) is Schwartz, it suffices to show that 

\9(t)l\io(t)\<\Ht)\\Hs<\\Iw(t)\\ Hl . (6.12) 

To do this, we argue in a manner reminiscent of the proof of Lemma 6.2. We 
first introduce the renormalized function w(t) defined by 

w(x, t) e- l8{t) e- lt w{x + x (t)). 

^From (6.11) and (6.10) we see that w evolves according to the equation 

iw t + w xx = -G(w, Q) + 6(t){Q + w)+w + x (t)d x (Q + w). 

(6.13) 

On the other hand, from (6.1) we have the orthogonality relations 

(w(t),A j )=0iorj = 0,l, 

where A , A\ are the Schwartz functions 

A := I(iQ 3 ); A 1 := Id x (Q 3 ). 

Let j = 0, 1. Differentiating the previous in time and then applying (6.13) we 
obtain 

-{iw xx ,Aj) = -e\t){i{Q + w),A J ) + {iw,A j )+xo{t){d x {Q + w)A J )-{iG{w,Q),A 1 ) 

which we rewrite as 

{i{Q + w),A 3 )e(t) - (d x (Q + w),Aj)xo(t) = -(w,i{Aj) xx ) + (G(w,Q),iA j ) 

This is a linear system of two equations j = 0, 1 and two unknowns 8(t), x (t). To 
invert this system we first observe that the right hand side is 0(||w||^») since the 
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Aj are Schwartz. Also, since IQ — Q has norm O(7V~ 10 °) in any reasonable space, 
we have the coefficient estimates 

(*(Q + w), A ) = {iQ, iQ 3 ) + (i(IQ - Q), iQ 3 ) + 0(\\w\\ H ') 

= \\Q\\i + 0(N-^) + 0(\\w\\ Hs ) 

(i(Q + w),Ai) = (iw, A x ) + 0{N- WO ) 

= O(\\w\\ Hs ) + O(N- 10Q ) 

(d x (Q + w),A ) = -{w, d x A Q ) + O(N- W0 ) 

= 0(\\w\\ H ,) + 0(N- im ) 

(d x (Q + tD),Ai> - (d x Q, d x {Q 3 )) + (d x (IQ - Q), d x (Q 3 ) - (w, d x A 1 ) 

= -(Q xx , 3 ) + 0(N- im ) + 0(||iD|| ff .)- 

Observe that the absolute constants \\Q\\\ and —(Q XX ,Q 3 ) =3/ Q 2 Q X are both 
non-zero. Thus if N is sufficiently large and ||ti)||jT« = is sufficiently small, 

we can invert the above 2x2 linear system and obtain the desired bounds (6.12). 
Note that this argument shows that one can make 9 it) and xo(i) differentiable in t. 
This completes the proof of (5.5). In particular we have the bounds 

\\w(t)\\Hs < \\Iw(t)\\ H i <a (6.14) 

on the interval [to — S, t a + S], so from (6.12) we have 

|<?(*)|,|io(*)|<*. (6-15) 

• Step 4. Control the increment of the modified energy. 

We now prove (5.1) again (or more precisely, we prove (5.1) for the new definition 
(6.8) of the modified energy). We write 

E N (t) = Lie- lt Qit) + e- u Iw(t)). 

We now use (1.5) to obtain 

d t E N it) = 2{Q(t) t -iQ(t)+Iwt-iIw, -d xx iQit)+Iw)+iQit)+Iw)-FiQit)+Iw)). 

We simplify the right factor using (1.2) to obtain 

d t E N (t) = 2{Q(t) t - iQ(t) + Iwt - ilw, -Iw xx + Iw - G{Iw, Q(t))). 

^From (6.10), (6.11) we have 

Q(t) t - iQ(t) + Iw t - ilw = i9(t)(Q(t) - IQit)) - ± (*)9x(Q(t) - IQ{t)) 

+ Iiiw xx - iw + iG(w, Q(t))). 

To show (5.1), it thus suffices by (3.8), (6.15) to show the bounds 
rto+S 

a\(Qit) - IQit), -Iw xx +Iw- G(Iw, Q(t)))| dt < N^+d 2 

(6.16) 

rto+S 

v\{d x iQit) - IQit)), -Iw xx +Iw- G(Iw, Qit)))\ dt < N- 1+ a 2 
7 *o (6.17) 

rto+S 

(Iiiw xx -iw + iG{w, Q(t))), -Iw xx +Iw- G(Iw, Qit))) dt\ < N- 1+ & 2 . 

(6.18) 



to 



/* 

J to 



Jtn 
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To prove (6.16), (6.17) we use integration by parts to move all the derivatives 
onto Q{t) — IQ(t). This function has a norm of iV~ 100 in any reasonable space. By 
(6.14) we thus see that these terms are acceptable. 

We now prove (6.18). Again we could do this by direct computation, but we 
shall instead just use the work that we have already done in previous sections. By 
(5.5) it suffices to show 

rto+S 

/ (Uw xx -iIw+iIG(w,Q(t)),-Iw xx +Iw-G(Iw,Q(t)))\ <N- 1+ \\Iw\\] L, 1/2+e . 

Jt„ [*o- 5 .*o+ 5 l 

The left-hand side consists of multilinear expressions of order between 2 and 6 in 
w. Thus (cf. Step 4 of the previous section) it suffices to prove the estimate 

rto+S 

| / (iIw xx -iIw + iIG(w,Q{t)),-Iw xx + Iw-G{Iw,Q(t)))\ < N~ 1+ 

J to 

for all spacetimc functions w in the unit ball of Xh^f 2 ^^ . Note that the parameter 

a has totally disappeared, and so we are now able to lose factors of O(N~ 10a ) as 
necessary. 

We now convert the above expression into one which can be dealt with by Lemma 
4.2. Fix w, and define the functions v(t) by 

v(t) :=w(t) + Q(t). 

^From (1.2) we have 

ilw xx — ilw + iIG(w, Q(t)) — I{iv xx — iv + iF(v)). 

Similarly we have 

-Iw xx +Iw- G(Iw, Q(t)) = -Iv xx + Iv- F(Iv) + B{t) 

where the error B(t) is given by 

B(t) := [F(Iw + IQ{t)) - F(Iw + Q(t))] - [IF(Q(t)) - F(Q(t))]. 

Recall that IQ{t) is within 0(N~ loa ) to Q(t) in H 1 norm (say), and similarly for 
IF(Q(t)) and F(Q(t)). Since Iw is also bounded in H 1 , we thus have 



\B(t)\\ H1 <N 



-100 



and so the contribution of B(t) is easily seen to be acceptable. Thus it remains to 
show that 

(■to+8 



1+ 



I / {I(iv xx +iv- iF(v), -Iv xx + Iv- F(Iv))\ < N- 

Jto 

^From integration by parts we have the identity 

(ilv, -Iv xx + Iv - F(Iv)) = 

so it suffices to show 

I r + \l(iv xx - iF(v), -Iv xx +Iv- F(Iv))\ < N- 1+ . 

J to 

But this is immediate from Lemma 4.2 and (4.13). This finishes the proof of (5.1), 
and thus Theorem 1.2 is (finally!) completely proved. 
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